Two-temperature logistic regression based on the Tsallis divergence

نویسندگان

  • Ehsan Amid
  • Manfred K. Warmuth
چکیده

We develop a variant of multiclass logistic regression that achieves three properties: i) We minimize a non-convex surrogate loss which makes the method robust to outliers, ii) our method allows transitioning between non-convex and convex losses by the choice of the parameters, iii) the surrogate loss is Bayes consistent, even in the non-convex case. The algorithm has one weight vector per class and the surrogate loss is a function of the linear activations (one per class). The surrogate loss of an example with linear activation vector a and class c has the form − logt1 expt2(ac −Gt2(a)) where the two temperatures t1 and t2 “temper” the log and exp, respectively, and Gt2 is a generalization of the log-partition function. We motivate this loss using the Tsallis divergence. As the temperature of the logarithm becomes smaller than the temperature of the exponential, the surrogate loss becomes “more quasi convex”. Various tunings of the temperatures recover previous methods and tuning the degree of non-convexity is crucial in the experiments. The choice t1 < 1 and t2 > 1 performs best experimentally. We explain this by showing that t1 < 1 caps the surrogate loss and t2 > 1 makes the predictive distribution have a heavy tail.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

"Divergence problem" in estimating temperature based on tree rings (Case study: Juniper mountainous habitats in northern Kerman province)

Introduction The study of tree rings is one of the most widely used methods of climate reconstruction for centuries and millennia, but the occurrence of climatic anomalies such as global warming in recent decades has caused divergence problem in the series of tree rings in some areas. Which challenges the ability of this proxy to regenerate the climate. The “divergence problem” is the differen...

متن کامل

Estimation of Cardinal Temperatures for Tomato (Solanum lycopersicom) Seed Germination Using Nonlinear Regression Models

Extended Abstract Introduction: Seed germination is one of the most important factors which determine the success of failure of crop establishment. In the absence of other environmental limiting factors such as moisture, temperature would determine the rate and overall seed germination. This research was conducted to investigate the effect of temperature regimes on seed germination, quantify t...

متن کامل

Penalized Bregman Divergence Estimation via Coordinate Descent

Variable selection via penalized estimation is appealing for dimension reduction. For penalized linear regression, Efron, et al. (2004) introduced the LARS algorithm. Recently, the coordinate descent (CD) algorithm was developed by Friedman, et al. (2007) for penalized linear regression and penalized logistic regression and was shown to gain computational superiority. This paper explores...

متن کامل

Non-Extensive Entropic Distance Based on Diffusion: Restrictions on Parameters in Entropy Formulae

Based on a diffusion-like master equation we propose a formula using the Bregman divergence for measuring entropic distance in terms of different non-extensive entropy expressions. We obtain the non-extensivity parameter range for a universal approach to the stationary distribution by simple diffusive dynamics for the Tsallis and the Kaniadakis entropies, for the Hanel–Thurner generalization, a...

متن کامل

Α-divergence Derived as the Generalized Rate Function in a Power-law System

The generalized binomial distribution in Tsallis statistics (power-law system) is explicitly formulated from the precise q-Stirling’s formula. The α-divergence (or q-divergence) is uniquely derived from the generalized binomial distribution in the sense that when α → −1 (i.e., q → 1) it recovers KL divergence obtained from the standard binomial distribution. Based on these combinatorial conside...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1705.07210  شماره 

صفحات  -

تاریخ انتشار 2017